4 |
Structured generative models for unsupervised named-entity clustering
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Coarse-to-fine n-best parsing and MaxEnt discriminative reranking
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Sentence-Internal Prosody Does not Help Parsing the Way Punctuation Does
|
|
|
|
BASE
|
|
Show details
|
|
18 |
BLLIP 1987-89 WSJ Corpus Release 1
|
|
|
|
Abstract:
*Introduction* Brown Laboratory for Linguistic Information Processing (BLLIP)1987-89 WSJ Corpus Release 1 contains a complete, Treebank-style part-of-speech (POS) tagged and parsed version of the three-year Wall Street Journal (WSJ) collection from ACL/DCI (LDC93T1), approximately 30 million words. The annotation was performed using statistically-based methods developed by BLIIP researchers Eugene Charniak, Don Blaheta, Niyu Ge, Keith Hall, John Hale and Mark Johnson. This corpus both overlaps and supplements the million-word Penn Treebank (PTB) collection of parsed and POS-tagged WSJ texts. *Data* The PTB project selected 2,499 stories from a three-year WSJ collection of 98,732 stories for syntactic annotation. These 2,499 stories are distributed in Treebank-2 (LDC95T7) and Treebank-3 (LDC99T42), both of which include the raw text for each story. *Updates* There are no updates at this time.
|
|
URL: https://catalog.ldc.upenn.edu/LDC2000T43
|
|
BASE
|
|
Hide details
|
|
19 |
Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|